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SEQUENCE LISTING 

<110> Lassen, Soren Flensted 

<120> improved fusion proteases and methods for producing them 

<130> 10495. 204-WO 

<160> 53 

<170> Patentln version 3.2 

<210> 1 

<211> 1062 

<212> DNA 

<213> Nocardiopsis sp. nrrl 18262 
<220> 

<2 2 1> mi s c_f eatu re 

<222> (1) . . (4955 

<223> wlw^ es • P r °- re 9 ion shown in positions -165 to -1 of seq id 
<220> 

<221> misc_feature 

<222> C496) . . C1059) 

<223> w!w? eS the mature region shown in positions 1-188 of seq id 



<400> 1 












gctactggag 


cattacctca 


gtctcctaca 


cctgaagcag 


atgcagtatc gatgcaagaa 


60 


gcattacaac 


gtgatcttga 


tcttacatca 


gctgaagctg 


aggaattact tgctgcacaa 


120 


gatacagcct 


ttgaagttga 


tgaagctgcc 


gctgaagcag 


ctggtgatgc atatggtggt 


180 


tcagtattcg 


atactgaatc 


actcgaactt 


actgtactag 


tgaccgatgc agcagctgtt 


240 


gaagctgttg 


aagccacagg 


tgcaggtaca 


gagctcgtat 


cttatggtat tgatggatta 


300 


gatgagatcg 


tacaagagct 


taatgcagct 


gatgccgttc 


caggtgtagt tggatggtat 


360 


cctgatgtag 


caggtgatac 


tgttgtctta 


gaagttcttg 


aaggctctgg agctgatgtt 


420 


tctggacttt 


tagcagacgc 


aggagtcgat 


gcatccgcgg 


ttgaagtgac cacgtcagat 


480 


cagcctgaac 


tctatgccga 


tatcattgga 


ggcctagcgt 


acacaatggg tggtcgctgc 


540 


agcgtaggat 


ttgcagccac 


aaatgcagct 


ggacaacctg 


gcttcgtgac agctggacat 


600 


tgcggccgcg 


tcggtacaca 


ggttactatc 


ggcaatggaa 


gaggtgtctt tgagcaaagc 


660 


gtatttcccg 


ggaatgatgc 


tgccttcgtt 


agaggtacgt 


ccaactttac gcttactaac 


720 


ttagtatcta 


gatacaacac 


tggcggatat 


gcaactgtag 


caggtcacaa tcaagcacct 


780 


attggctcta 


gcgtctgccg 


ctcagggtcg 


actacaggat 


ggcattgtgg aaccattcaa 


840 


gctagaggtc 


agagcgtgag 


ctatcctgaa 


ggtaccgtaa 


cgaacatgac tcgtacgact 


900 


gtatgtgcag 


aaccaggtga 


ctctggaggt 


tcatatatca 


gcggtacgca agcgcaaggc 


960 


gttacctcag 


gtggatccgg 


taactgtagg 


acaggtggca 


caacgttcta ccaggaagtg 


1020 


acaccgatgg 


tgaactcttg 


gggagttaga 


ctccgtacat 


aa 


1062 
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<211> 1143 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> "M^SSlS 18 R o£o n? C10Rsynt-15) encoding a S2A protease denoted 
10R fused by pcr in frame to the signal peptide encodi no 
sequence of a heterologous protease, savinase. encotnn 9 

<400> 2 

atgaagaaac cgttggggaa aattgtcgca agcaccgcac tactcatttc tgttgctttt 60 

agttcatcga tcgcatcggc tgctactgga gcattacctc agtctcctac acctgaagca 120 

gatgcagtat cgatgcaaga agcattacaa cgtgatcttg atcttacatc agctgaagct 180 

gaggaattac ttgctgcaca agatacagcc tttgaagttg atgaagctgc cgctgaagca 240 

gctggtgatg catatggtgg ttcagtattc gatactgaat cactcgaact tactgtacta 300 

gtgaccgatg cagcagctgt tgaagctgtt gaagccacag gtgcaggtac agagctcgta 360 

tcttatggta ttgatggatt agatgagatc gtacaagagc ttaatgcagc tgatgccgtt 420 

ccaggtgtag ttggatggta tcctgatgta gcaggtgata ctgttgtctt agaagttctt 480 

gaaggctctg gagctgatgt ttctggactt ttagcagacg caggagtcga tgcatccgcg 540 

gttgaagtga ccacgtcaga tcagcctgaa ctctatgccg atatcattgg aggcctagcg 600 

tacacaatgg gtggtcgctg cagcgtagga tttgcagcca caaatgcagc tggacaacct 660 

ggcttcgtga cagctggaca ttgcggccgc gtcggtacac aggttactat cggcaatgga 720 

agaggtgtct ttgagcaaag cgtatttccc gggaatgatg ctgccttcgt tagaggtacg 780 

tccaacttta cgcttactaa cttagtatct agatacaaca ctggcggata tgcaactgta 840 

gcaggtcaca atcaagcacc tattggctct agcgtctgcc gctcagggtc gactacagga 900 

tggcattgtg gaaccattca agctagaggt cagagcgtga gctatcctga aggtaccgta 960 

acgaacatga ctcgtacgac tgtatgtgca gaaccaggtg actctggagg ttcatatatc 1020 

agcggtacgc aagcgcaagg cgttacctca ggtggatccg gtaactgtag gacaggtggc 1080 

acaacgttct accaggaagt gacaccgatg gtgaactctt ggggagttag actccgtaca 1140 
taa 

1143 

<210> 3 
<211> 8 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> inieSiSS] amin ° aCid tai1 ex P ressed ^ fusion to protease of the 
<400> 3 

Gin ser His val Gin ser Ala Pro 
1 5 

<210> 4 
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<211> 24 10495.204-WO.ST25.txt 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> PoJVnucleotide encoding a c-terminal amino acid tail expressed as 
msion to protease of the invention. 

<400> 4 

caatcgcatg ttcaatccgc tcca 24 

<210> 5 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Won? 1 " 31 amino acid tai1 expressed as fusion to protease of the 

<400> 5 

Gin ser Ala Pro 
1 

<210> 6 
<211> 12 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polynucleotide encoding a c-terminal amino acid tail expressed as 
fusion to protease of the invention. expressea as 

<400> 6 

caatcggctc ct 12 

<210> 7 

<211> 2 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> invention 1 3Cid tai1 expressed as fusion to protease of the 

<400> 7 

Gin Pro 
1 

<210> 8 
<211> 6 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Polynucleotide encoding a c-terminal amino acid tail expressed as 
fusion to protease of the invention. expressea as 

<400> 8 
caacca 
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<210> 



9 
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<211> 1 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> invenSon" 1 ami "° aCid tail ex P ressed as fusion to protease of the 

<400> 9 

Pro 
1 



<210> 10 

<211> 3 

<212> DNA 

<213> Artificial sequence 
<220> 



<400> 10 
cca 



<210> 11 
<211> 45 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #252639 

<400> 11 

catgtgcatg tgggtaccgc aacgttcgca gatgctgctg aagag 

<210> 12 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #251992 

<400> 12 

catgtgcatg tggtcgaccg attatggagc ggattgaaca tgcg 

<210> 13 

<211> 44 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> primer #179541 

<400> 13 

gcgttgagac gcgcggccgc gagcgccgtt tggctgaatg atac 

<210> 14 

<211> 43 

<212> DNA 

<213> Artificial sequence 



<223> 




1 amino acid tail expressed as 
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<220> 

<223> Primer #179542 
<400> 14 

gcgttgagac agctcgagca gggaaaaatg gaaccgcttt ttc 43 

<210> 15 

<211> 64 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179539 

<400> 15 

ccatttgatc agaattcact ggccgtcgtt ttacaaccat tgcggaaaat agtcataggc 60 

atcc CA 

64 

<210> 16 
<211> 60 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179540 
<400> 16 

ggatccagat ctggtacccg ggtctagagt cgacgcggcg gttcgcgtcc ggacagcaca 60 

<210> 17 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179154 

<400> 17 

gttgtaaaac gacggccagt gaattctgat caaatgg 37 

<210> 18 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #179153 

<400> 18 

ccgcgtcgac actagacacg ggtacctgat ctagatc 37 

<210> 19 
<211> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #317 
<400> 19 

tggcgcaatc ggtaccatgg gg 22 
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<210> 20 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer #139 NotI 
<400> 20 

catgtgcatg cggccgcatt aacgcgttgc cgcttctgcg 40 

<210> 21 
<211> 7443 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> sequence of plasmid pMBl508 
<400> 21 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 

accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 

attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 

tacgccagct ggcgaaaggg ggatgtgctg caaggcgatt aagttgggta acgccagggt 360 

tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cgataaaagt gctttttttg 420 

ttgcaattga agaattatta atgttaagct taattaaaga taatatcttt gaattgtaac 480 

gcccctcaaa agtaagaact acaaaaaaag aatacgttat atagaaatat gtttgaacct 540 

tcttcagatt acaaatatat tcggacggac tctacctcaa atgcttatct aactatagaa 600 

tgacatacaa gcacaacctt gaaaatttga aaatataact accaatgaac ttgttcatgt 660 

gaattatcgc tgtatttaat tttctcaatt caatatataa tatgccaata cattgttaca 720 

agtagaaatt aagacaccct tgatagcctt actataccta acatgatgta gtattaaatg 780 

aatatgtaaa tatatttatg ataagaagcg acttatttat aatcattaca tatttttcta 840 

ttggaatgat taagattcca atagaatagt gtataaatta tttatcttga aaggagggat 900 

gcctaaaaac gaagaacatt aaaaacatat atttgcaccg tctaatggat ttatgaaaaa 960 

tcattttatc agtttgaaaa ttatgtatta tggagctctg aaaaaaagga gaggataaag 1020 

aatgaagaaa ccgttgggga aaattgtcgc aagcaccgca ctactcattt ctgttgcttt 1080 

tagttcatcg atcgcatcgg ctgctgaaga agcaaaagaa aaatatttaa ttggctttaa 1140 

tgagcaggaa gctgtcagtg agtttgtaga acaagtagag gcaaatgacg aggtcgccat 1200 

tctctctgag gaagaggaag tcgaaattga attgcttcat gaatttgaaa cgattcctgt 1260 

tttatccgtt gagttaagcc cagaagatgt ggacgcgctt gaactcgatc cagcgatttc 1320 

ttatattgaa gaggatgcag aagtaacgac aatggcgcaa tcggtaccat ggggtatatc 1380 
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aacgcgttaa tccgcggata tatagcggcc gcagatctgg gaccaataat aatgactaga 1440 

gaagaaagaa tgaagattgt tcatgaaatt aaggaacgaa tattggataa agtgggatat 1500 

ttttaaaata tatatttatg ttacagtaat attgactttt aaaaaaggat tgattctaat 1560 

gaagaaagca gacaagtaag cctcctaaat tcactttaga taaaaattta ggaggcatat 1620 

caaatgaact ttaataaaat tgatttagac aattggaaga gaaaagagat atttaatcat 1680 

tatttgaacc aacaaacgac ttttagtata accacagaaa ttgatattag tgttttatac 1740 

cgaaacataa aacaagaagg atataaattt taccctgcat ttattttctt agtgacaagg 1800 

gtgataaact caaatacagc ttttagaact ggttacaata gcgacggaga gttaggttat 1860 

tgggataagt tagagccact ttatacaatt tttgatggtg tatctaaaac attctctggt 1920 

atttggactc ctgtaaagaa tgacttcaaa gagttttatg atttatacct ttctgatgta 1980 

gagaaatata atggttcggg gaaattgttt cccaaaacac ctatacctga aaatgctttt 2040 

tctctttcta ttattccatg gacttcattt actgggttta acttaaatat caataataat 2100 

agtaattacc ttctacccat tattacagca ggaaaattca ttaataaagg taattcaata 2160 

tatttaccgc tatctttaca ggtacatcat tctgtttgtg atggttatca tgcaggattg 2220 

tttatgaact ctattcagga attgtcagat aggcctaatg actggctttt ataatatgag 2280 

ataatgccga ctgtactttt tacagtcggt tttctaacga tacattaata ggtacgaaaa 2340 

agcaactttt tttgcgqtta aaaccagtca taccaataac ttaagggtaa ctagcctcgc 2400 

cggaaagagc gaaaatgcct cacatttgtg ccacctaaaa aggagcgatt tacatatgag 2460 

ttatgcagtt tgtagaatgc aaaaagtgaa atcagctgga ctaaaagggg ccgcagagta 2520 

gaatggaaaa ggggatcgga aaacaagtat ataggaggag acctatttat ggcttcagaa 2580 

aaagacgcag gaaaacagtc agcagtaaag cttgttccat tgcttattac tgtcgctgtg 2640 

ggactaatca tctggtttat tcccgctccg tccggacttg aacctaaagc ttggcatttg 2700 

tttgcgattt ttgtcgcaac aattatcggc tttatctcca agcccttgcc aatgggtgca 2760 

attgcaattt ttgcattggc ggttactgca ctaactggaa cactatcaat tgaggataca 2820 

ttaagcggat tcgggaataa gaccatttgg cttatcgtta tcgcattctt tatttcccgg 2880 

ggatttatca aaaccggtct cggtgcgaga atttcgtatg tattcgttca gaaattcgga 2940 

aaaaaaaccc ttggactttc ttattcactg ctattcagtg atttaatact ttcacctgct 3000 

attccaagta atacggcgcg tgcaggaggc attatatttc ctattatcag atcattatcc 3060 

gaaacattcg gatcaagccc ggcaaatgga acagagagaa aaatcggtgc attcttatta 3120 

aaaaccggtt ttcaggggaa tctgatcaca tctgctatgt tcctgacagc gatggcggcg 3180 

aacccgctga ttgccaagct ggcccatgat gtcgcagggg tggacttaac atggacaagc 3240 

tgggcaattg ccgcgattgt accgggactt gtaagcttaa tcatcacgcc gcttgtgatt 3300 

tacaaactgt atccgccgga aatcaaagaa acaccggatg cggcgaaaat cgcaacagaa 3360 

aaactgaaag aaatgggacc gttcaaaaaa tcggagcttt ccatggttat cgtgtttctt 3420 
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ttggtgcttg tgctgtggat ttttggcggc agcttcaaca tcgacgctac 
ttgatcggtt tggccgttct cttattatca caagttctga cttgggatga 
gaacagggcg cttgggatac gctcacttgg tttgcggcgc ttgtcatgct 
ttgaatgaat taggcatggt gtcttggttc agtaatgcca tgaaatcatc 
ttctcttgga ttgtggcatt catcatttta attgttgtgt attattactc 
tttgcaagtg cgacagccca catcagtgcg atgtattcag catttttggc 
gcagcgggcg caccgccgct tttagcagcg ctgagcctcg cgttcatcag 
gggtcaacga ctcactacgg ttctggagcg gctccggtct tcttcggagc 
ccgcaaggca aatggtggtc catcggattt atcctgtcga ttgttcatat 
cttgtgatcg gcggattatg gtggaaagta ctaggaatat ggtagaaaga 
cgcggtctgc ctttttttat tttcactcct tcgtaagaaa atggattttg 
aaattccctg tgaaaaatgg tatgatctag gtagaaagga cggctggtgc 
aagcggttcc atttttccct gcaaacaaaa ataatggggc tgattgcggc 
tttgtcattg gtgtgctgac cattacgtta gccgttcagc atacacaggg 
caggcagagc agctggcggt tcaaacggcg agaaccattt cctatatgcc 
gagctcattg agagaaaaga cggacatgcg gctcagacgc aagaggtcat 
aaagaacaga ctggtgcgtt tgccatttat gttttgaacg aaaaaggaga 
gcctctggaa aaagcggatt aaagaaactg gagcgcagca gagaaatttt 
tcgcatgttt ctgaaacaaa agcggatgga cgaagagtga tcagagggag 
ataaaagaac agaagggata cagccaagtg atcggcagcg tgtctgttga 
acggagacag agcaaagcat caaaaagcat ttgagaaatt tgagtgtgat 
gtactgctgc tcggatttat tggcgccgcc gtgctggcga aaagcatcag 
ctcgggcttg aaccgcatga gatcgcggct ctatatcgtg agaggaacgc 
gcgattcgag aagggattat tgccaccaat cgtgaaggcg tcgtcaccat 
tcggcggccg agatgctgaa gctgcccgag cctgtgatcc atcttcctat 
atgccgggag cagggctgat gtctgtgctt gaaaaaggag aaatgctgcc 
gtaagcgtca acgatcaagt gtttattatc aatacgaaag tgatgaatca 
gcgtatggga ttgtcgtcag cttcagggag aaaacagagc tgaagaagct 
ttgacagagg ttcgcaaata ttcagaggat ctcagggcgc agactcatga 
aagctttatg cgattttagg gctgcgtcga cctgcaggca tgcaagcttg 
ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac 
ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc 
cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg 



tcggccaacg cgcggggaga ggcggtttgc gtattgggcg ctcttccgct 
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cacaaccgca 
tatcaagaaa 
cgccaacttc 
cgtatcaggg 
tcactatttc 
tgtcgtcgtg 
caacctgttc 
aggctacatc 
catcgtatgg 
aaaaggcaga 
aaaaatgaga 
tgtggtgaaa 
tctgctggtc 
agaacggaga 
gccggttaaa 
tgaacaaatg 
cattcgcagc 
gtttggcggt 
cgcgccgatt 
ttttctgcaa 
tgctgtgctt 
aaaggatacg 
aatgcttttc 
gatgaacgta 
agatgacgtc 
gaaccaggaa 
aggcgggcag 
gatcgacaca 
attttcaaat 
gcgtaatcat 
aacatacgag 
acattaattg 
cattaatgaa 
tcctcgctca 



3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 
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ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 5520 
taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 5580 
agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 5640 
cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 5700 
tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 5760 
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 5820 
gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 5880 
acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 5940 
acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 6000 
cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 6060 
gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 6120 
gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 6180 
agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 6240 
ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 6300 

ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 6360 

atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 6420 

tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac 6480 

gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg 6540 

caccggattt atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg 6600 

caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt 6660 

cgccagttaa tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct 6720 

cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat 6780 

cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta 6840 

agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca 6900 

tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat 6960 

agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac 7020 

atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa 7080 

ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt 7140 

cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg 7200 

caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat 7260 

attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt 7320 

agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct 7380 

aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc 7440 
gtc 

7443 
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<210> 22 
<211> 5718 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> sequence of MB1510 genomic integration region 
<400> 22 

gagcgccgtt tggctgaatg atacaacagt ctcacttcct tactgcgtct ggttgcaaaa 60 

acgaagaagc aaggattccc ctcgcttctc atttgtccta tttattatac acttttttaa 120 

gcacatcttt ggcgcttgtt tcactagact tgatgcctct gaatcttgtc caagtgtcac 180 

ggtccgcatc atagacttgt ccatttttca ccgctttgag atttttccag agcgggttcg 240 

ttttccactc atctacaatg gttttgcctt cgttggctga gatgaacaaa atatcaggat 300 

cgattttgct caattgctca aggctgacct cttgataggc gttatctgac ttcacagcgt 360 

gtgtaaagcc tagcatttta aagatttctc cgtcatagga tgatgatgta tgaagctgga 420 

aggaatccgc tcttgcaacg ccgagaacga tgttgcggtt ttcatctttc ggaagttcgg 480 

cttttagatc gttgatgact tttttgtgct cggcaagctt ttcttttcct tcatcttctt 540 

tatttaatgc tttagcaatg gtcgtaaagc tgtcgatcgt ttcgtcatat gtcgcttcac 

ggctttttaa ttcaatcgtc ggggcgattt ttttcagctg tttataaatg tttttatggc 

gctcagcgtc agcgatgatt aaatcaggct tcaaggaact gatgacctca agattgggtt 720 

cgctgcgtgt gcctacagat gtgtaatcaa tggagctgcc gacaagcttt ttaatcatat 780 

cttttttgtt gtcatctgcg atgcccaccg gcgtaatgcc gagattgtga acggcatcca 

agaatgaaag ctcaagcaca accacccgct taggtgtgcc gcttactgtc gtttttcctt 

cttcgtcatg gatcactctg gaatccttag actcgctttt gccgcttccg ttgttattct 

ggcttgatga acagccggat acaatgaggc aggcgagcaa taaaacactc atgatggcaa 1020 

tcaacttgtt agaataggtg cgcatgtcat tcttcctttt ttcagattta gtaatgagaa 1080 

tcattatcac atgtaacact ataatagcat ggcttatcat gtcaatattt ttttagtaaa 1140 

gaaagctgcg tttttactgc tttctcatga aagcatcatc agacacaaat aagtggtatg 1200 

cagcgttacc gtgtcttcga gacaaaaacg catgggcgtt ggctttagag gtttcgaaca 1260 

tatcagcagt gacataagga aggagagtgc tgagataacc ggacaatttc ttttctattt 1320 

catctgttag tgcaaattca atgtcgccga tattcatgat aatcgagaaa acaaagtcga 1380 

tatcgatatg aaaatgttcc tcggcaaaaa ccgcaagctc gtgaattcct ggtgaacatc 1440 

cggcacgctt atggaaaatc tgtttgacta aatcactcac aatccaagca ttgtattgct 1500 

gttctggtga aaagtattgc attagacata cctcctgctc gtacggataa aggcagcgtt 1560 

tcatggtcgt gtgctccgtg cagcggcttc tccttaattt tgatttttct gaaaataggt 1620 

cccgttccta tcactttacc atggacggaa aacaaatagc tactaccatt cctcctgttt 1680 

ttctcttcaa tgttctggaa tctgtttcag gtacagacga tcgggtatga aagaaatata 1740 
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gaaaacatga aggaggaata tcgacatgaa accagttgta aaagagtata caaatgacga 1800 

acagctcatg aaagatgtag aggaattgca gaaaatgggt gttgcgaaag aggatgtata 1860 

cgtcttagct cacgacgatg acagaacgga acgcctggct gacaacacga acgccaacac 1920 

gatcggagcc aaagaaacag gtttcaagca cgcggtggga aatatcttca ataaaaaagg 1980 

agacgagctc cgcaataaaa ttcacgaaat cggtttttct gaagatgaag ccgctcaatt 2040 

tgaaaaacgc ttagatgaag gaaaagtgct tctctttgtg acagataacg aaaaagtgaa 2100 

agcttgggca taaagcaagg aaaaaaccaa aaggccaatg tcggcctttt ggtttttttg 2160 

cggtctttgc ggtgggattt tgcagaatgc cgcaatagga tagcggaaca ttttcggttc 2220 

tgaatgtccc tcaatttgct attatatttt tgtgataaat tggaataaaa tctcacaaaa 2280 

tagaaaatgg gggtacatag tggatgaaaa aagtgatgtt agctacggct ttgtttttag 2340 

gattgactcc agctggcgcg aacgcagctg atttaggcca ccagacgttg ggatccaatg 2400 

atggctgggg cgcgtactcg accggcacga caggcggatc aaaagcatcc tcctcaaatg 2460 

tgtataccgt cagcaacaga aaccagcttg tctcggcatt agggaaggaa acgaacacaa 2520 

cgccaaaaat catttatatc aagggaacga ttgacatgaa cgtggatgac aatctgaagc 2580 

cgcttggcct aaatgactat aaagatccgg agtatgattt ggacaaatat ttgaaagcct 2640 

atgatcctag cacatggggc aaaaaagagc cgtcgggaac acaagaagaa gcgagagcac 2700 

gctctcagaa aaaccaaaaa gcacgggtca tggtggatat ccctgcaaac acgacgatcg 2760 

tcggttcagg gactaacgct aaagtcgtgg gaggaaactt ccaaatcaag agtgataacg 2820 

tcattattcg caacattgaa ttccaggatg cctatgacta ttttccgcaa tggttgtaaa 2880 

acgacggcca gtgaattctg atcaaatggt tcagtgagag cgaagcgaac acttgatttt 2940 

ttaattttct atcttttata ggtcattaga gtatacttat ttgtcctata aactatttag 3000 

cagcataata gatttattga ataggtcatt taagttgagc atattagagg aggaaaatct 3060 

tggagaaata tttgaagaac ccgagatcta gatcaggtac cgcaacgttc gcagatgctg 3120 

ctgaagagat tattaaaaag ctgaaagcaa aaggctatca attggtaact gtatctcagc 3180 

ttgaagaagt gaagaagcag agaggctatt gaataaatga gtagaaagcg ccatatcggc 3240 

gcttttcttt tggaagaaaa tatagggaaa atggtacttg ttaaaaattc ggaatattta 3300 

tacaatatca tatgtatcac attgaaagga ggggcctgct gtccagactg tccgctgtgt 3360 

aaaaataagg aataaagggg ggttgacatt attttactga tatgtataat ataatttgta 3420 

taagaaaatg gaggggccct cgaaacgtaa gatgaaacct tagataaaag tgcttttttt 3480 

gttgcaattg aagaattatt aatgttaagc ttaattaaag ataatatctt tgaattgtaa 3540 

cgcccctcaa aagtaagaac tacaaaaaaa gaatacgtta tatagaaata tgtttgaacc 3600 

ttcttcagat tacaaatata ttcggacgga ctctacctca aatgcttatc taactataga 3660 

atgacataca agcacaacct tgaaaatttg aaaatataac taccaatgaa cttgttcatg 3720 

tgaattatcg ctgtatttaa ttttctcaat tcaatatata atatgccaat acattgttac 3780 
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aagtagaaat taagacaccc ttgatagcct tactatacct aacatgatgt agtattaaat 3840 

gaatatgtaa atatatttat gataagaagc gacttattta taatcattac atatttttct 3900 

attggaatga ttaagattcc aatagaatag tgtataaatt atttatcttg aaaggaggga 3960 

tgcctaaaaa cgaagaacat taaaaacata tatttgcacc gtctaatgga tttatgaaaa 4020 

atcattttat cagtttgaaa attatgtatt atggagctct gaaaaaaagg agaggataaa 4080 

gagaaaaggg gatcggaaaa caagtatata ggaggagacc tatttatggc ttcagaaaaa 4140 

gacgcaggaa aacagtcagc agtaaagctt gttccattgc ttattactgt cgctgtggga 4200 

ctaatcatct ggtttattcc cgctccgtcc ggacttgaac ctaaagcttg gcatttgttt 4260 

gcgatttttg tcgcaacaat tatcggcttt atctccaagc ccttgccaat gggtgcaatt 4320 

gcaatttttg cattggcggt tactgcacta actggaacac tatcaattga ggatacatta 4380 

agcggattcg ggaataagac catttggctt atcgttatcg cattctttat ttcccgggga 4440 

tttatcaaaa ccggtctcgg tgcgagaatt tcgtatgtat tcgttcagaa attcggaaaa 4500 

aaaacccttg gactttctta ttcactgcta ttcagtgatt taatactttc acctgctatt 4560 

ccaagtaata cggcgcgtgc aggaggcatt atatttccta ttatcagatc attatccgaa 4620 

acattcggat caagcccggc aaatggaaca gagagaaaaa tcggtgcatt cttattaaaa 4680 

accggttttc aggggaatct gatcacatct gctatgttcc tgacagcgat ggcggcgaac 4740 

ccgctgattg ccaagctggc ccatgatgtc gcaggggtgg acttaacatg gacaagctgg 4800 

gcaattgccg cgattgtacc gggacttgta agcttaatca tcacgccgct tgtgatttac 4860 
aaactgtatc cgccggaaat caaagaaaca ccggatgcgg cgaaaatcgc aacagaaaaa . 4920 

ctgaaagaaa tgggaccgtt caaaaaatcg gagctttcca tggttatcgt gtttcttttg 4980 

gtgcttgtgc tgtggatttt tggcggcagc ttcaacatcg acgctaccac aaccgcattg 5040 

atcggtttgg ccgttctctt attatcacaa gttctgactt gggatgatat caagaaagaa 5100 

cagggcgctt gggatacgct cacttggttt gcggcgcttg tcatgctcgc caacttcttg 5160 

aatgaattag gcatggtgtc ttggttcagt aatgccatga aatcatccgt atcagggttc 5220 

tcttggattg tggcattcat cattttaatt gttgtgtatt attactctca ctatttcttt 5280 

gcaagtgcga cagcccacat cagtgcgatg tattcagcat ttttggctgt cgtcgtggca 5340 

gcgggcgcac cgccgctttt agcagcgctg agcctcgcgt tcatcagcaa cctgttcggg 5400 

tcaacgactc actacggttc tggagcggct ccggtcttct tcggagcagg ctacatcccg 5460 

caaggcaaat ggtggtccat cggatttatc ctgtcgattg ttcatatcat cgtatggctt 5520 

gtgatcggcg gattatggtg gaaagtacta ggaatatggt agaaagaaaa aggcagacgc 5580 

ggtctgcctt tttttatttt cactccttcg taagaaaatg gattttgaaa aatgagaaaa 5640 

ttccctgtga aaaatggtat gatctaggta gaaaggacgg ctggtgctgt ggtgaaaaag 5700 

cggttccatt tttccctg ___ 0 
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<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1605 
<400> 23 

gacggccagt gaattcgata aaagtgc 27 

<210> 24 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1606 
<220> 

<221> misc_feature 

<222> (13) . . (13) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (16) . . (16) 

<223> n is a, c, g, or t 

<400> 24 

ccagatctct atnktnktgt acggagtcta actccccaag ag 42 

<210> 25 
<211> 1112 
<212> DNA 

<213> Nocardiopsis dassonvillei dsm 43235 
<400> 25 

gcttttagtt catcgatcgc atcggctgct ccggcccccg tcccccagac ccccgtcgcc 
gacgacagcg ccgccagcat gaccgaggcg ctcaagcgcg acctcgacct cacctcggcc 
gaggccgagg agcttctctc ggcgcaggaa gccgccatcg agaccgacgc cgaggccacc 
gaggccgcgg gcgaggccta cggcggctca ctgttcgaca ccgagaccct cgaactcacc 240 
gtgctggtca ccgacgcctc cgccgtcgag gcggtcgagg ccaccggagc ccaggccacc 300 
gtcgtctccc acggcaccga gggcctgacc gaggtcgtgg aggacctcaa cggcgccgag 
gttcccgaga gcgtcctcgg ctggtacccg gacgtggaga gcgacaccgt cgtggtcgag 
gtgctggagg gctccgacgc cgacgtcgcc gccctgctcg ccgacgccgg tgtggactcc 
tcctcggtcc gggtggagga ggccgaggag gccccgcagg tctacgccga catcatcggc 
ggcctggcct actacatggg cggccgctgc tccgtcggct tcgccgcgac caacagcgcc 
ggtcagcccg gtttcgtcac cgccggccac tgcggcaccg tcggcaccgg cgtgaccatc 
ggcaacggca ccggcacctt ccagaactcg gtcttccccg gcaacgacgc cgccttcgtc 
cgcggcacct ccaacttcac cctgaccaac ctggtctcgc gctacaactc cggcggctac 
cagtcggtga ccggtaccag ccaggccccg gccggctcgg ccgtgtgccg ctccggctcc 840 
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accaccggct ggcactgcgg caccatccag gcccgcaacc agaccgtgcg ctacccgcag 900 

ggcaccgtct actcgctcac ccgcaccaac gtgtgcgccg agcccggcga ctccggcggt 960 

tcgttcatct ccggctcgca ggcccagggc gtcacctccg gcggctccgg caactgctcc 1020 

gtcggcggca cgacctacta ccaggaggtc accccgatga tcaactcctg gggtgtcagg 1080 

atccggacct aatcgcatgt tcaatccgct cc 1112 

<210> 26 

<211> 48 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1423 

<400> 26 

gcttttagtt catcgatcgc atcggctgct ccggcccccg tcccccag 48 

<210> 27 

<211> 45 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1475 

<400> 27 

ggagcggatt gaacatgcga ttaggtccgg atcctgacac cccag 45 

<210> 28 

<211> 354 

<212> prt 

<213> Nocardiopsis dassonvillei DSM 43235 
<220> 

<221> propep 

<222> (1) . . (166) 

<220> 

<221> mat_peptide 
<222> C1675..C354) 

<400> 28 

Ala pro Ala Pro val Pro Gin Thr pro Val Ala Asp Asp ser Ala 
-lo5 -160 -155 

Ala S ?cn Met Thr Glu Ala Leu L V S Ar 9 Asp Leu Asp Leu Thr ser 
_15 0 -145 _ 140 

Ala Glu Ala Glu Glu Leu Leu ser Ala Gin Glu Ala Ala He Glu 
-l" -130 -125 

Thr Asp Ala Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly 
-120 -115 _ 110 j j 
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Ser Leu Phe Asp Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr asd 
-105 -100 -95 

Ala ser Ala Val Glu Ala val Glu Ala Thr Gly Ala Gin Ala Thr val 
~ yu -85 -80 -75 

Val ser His Gly Thr Glu Gly Leu Thr Glu val Val Glu Asp Leu Asn 
-70 -65 -60 

Gly Ala Glu Val Pro Glu ser val Leu Gly Trp Tyr Pro Asp val Glu 
-55 -50 -45 

Ser Asp Thr Val Val val Glu val Leu Glu Gly ser Asp Ala Asd val 
-40 -35 -30 

Ala Ala Leu Leu Ala Asp Ala Gly val Asp ser ser ser val Arg val 
25 -20 -15 

Glu Glu Ala Glu Glu Ala Pro Gin val Tyr Ala Asp He lie Gly Gly 
-xv -5 -11 5 

Leu Ala Tyr Tyr Met Gly Gly Arg Cys Ser Val Gly Phe Ala Ala Thr 
10 15 20 

Asn ser Ala Gly Gin Pro Gly Phe val Thr Ala Gly His cys Gly Thr 
25 30 35 

Val Gly Thr Gly val Thr lie Gly Asn Gly Thr Gly Thr Phe Gin Asn 
w 45 5q 

Ser val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn 
" 60 65 70 

Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Ser Gly Gly Tyr Gin 
75 80 85 

Ser val Thr Gly Thr Ser Gin Ala Pro Ala Gly ser Ala val Cys Arq 
90 95 100 

ser Gly Ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Asn 
105 110 115 

Gin Thr val Arg Tyr Pro Gin Gly Thr Val Tyr ser Leu Thr Arg Thr 

Asn val cys Ala Glu Pro Gly Asp ser Gly Gly ser Phe lie ser Gly 
135 140 145 150 

Ser Gin Ala Gin Gly val Thr ser Gly Gly Ser Gly Asn cys ser Val 
3-55 160 165 
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Gly Gly Thr Thr Tyr Tyr Gin G"h? vll" Thr" 1?ro Met' I* e Asn Ser Trp 
170 175 180 



Gly val Arg lie Arg Thr 
185 



60 



<210> 29 
<211> 498 
<212> DNA 

<213> Nocardiopsis dassonvillei dsm 43235 
<400> 29 

gctccggccc ccgtccccca gacccccgtc gccgacgaca gcgccgccag catgaccgag 
gcgctcaagc gcgacctcga cctcacctcg gccgaggccg aggagcttct ctcggcgcag 120 
gaagccgcca tcgagaccga cgccgaggcc accgaggccg cgggcgaggc ctacggcggc 180 
tcactgttcg acaccgagac cctcgaactc accgtgctgg tcaccgacgc ctccgccgtc 240 
gaggcggtcg aggccaccgg agcccaggcc accgtcgtct cccacggcac cgagggcctg 300 
accgaggtcg tggaggacct caacggcgcc gaggttcccg agagcgtcct cggctggtac 360 
ccggacgtgg agagcgacac cgtcgtggtc gaggtgctgg agggctccga cgccgacgtc 420 
gccgccctgc tcgccgacgc cggtgtggac tcctcctcgg tccgggtgga ggaggccgag 480 
gaggccccgc aggtctac 4g8 

<210> 30 

<211> 166 

<212> PRT 

<213> Nocardiopsis dassonvillei DSM 43235 

<400> 30 

Ala Pro Ala pro val pro Gin Thr pro val Ala Asp Asp ser Ala Ala 
X 5 10 15 

Ser Met Thr Glu Ala Leu Lys Arg Asp Leu Asp Leu Thr ser Ala Glu 
zu 25 30 

Ala Glu Glu Leu Leu Ser Ala Gin Glu Ala Ala He Glu Thr Asp Ala 
da 40 45 

Glu Ala Thr Glu Ala Ala Gly Glu Ala Tyr Gly Gly ser Leu Phe Asp 
■>« 55 60 

Thr Glu Thr Leu Glu Leu Thr val Leu Val Thr Asp Ala Ser Ala val 

70 75 80 

Glu Ala Val Glu Ala Thr Gly Ala Gin Ala Thr Val Val ser His Gly 
85 90 95 3 

Thr Glu Gly Leu Thr Glu Val val Glu Asp Leu Asn Gly Ala Glu Val 

-LUU 105 HQ 
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Pro G!u sen va! Leu Gly rrp TV^ ^pTai^^ Asp nr val 
■i.J.3 120 125 

Val yal Glu val Leu Glu Gly sen Asp Ala Asp Val Ala Ala Leu Leu 
XDXJ 135 140 

Ala Asp Ala Gly Val Asp ser ser ser val Arg Val Glu Glu Ala Glu 
xn 150 155 160 

Glu Ala Pro Gin Val Tyr 
165 

<210> 31 
<211> 1146 
<212> DMA 

<213> Artificial sequence 
<220> 

<223> The dna sequence coding for the pro- region of SEQ id no- 29 fused 
<400> 31 

atgaagaaac cgttggggaa aattgtcgca agcaccgcac tactcatttc tgttgctttt 60 

agttcatcga tcgcatcggc tgctccggcc cccgtccccc agacccccgt cgccgacgac 120 

agcgccgcca gcatgaccga ggcgctcaag cgcgacctcg acctcacctc ggccgaggcc 180 

gaggagcttc tctcggcgca ggaagccgcc atcgagaccg acgccgaggc caccgaggcc 240 

gcgggcgagg cctacggcgg ctcactgttc gacaccgaga ccctcgaact caccgtgctg 300 

gtcaccgacg cctccgccgt cgaggcggtc gaggccaccg gagcccaggc caccgtcgtc 360 

tcccacggca ccgagggcct gaccgaggtc gtggaggacc tcaacggcgc cgaggttccc 420 

gagagcgtcc tcggctggta cccggacgtg gagagcgaca ccgtcgtggt cgaggtgctg 480 

gagggctccg acgccgacgt cgccgccctg ctcgccgacg ccggtgtgga ctcctcctcg 540 

gtccgggtgg aggaggccga ggaggccccg caggtctatg ccgatatcat tggaggccta 600 

gcgtacacaa tgggtggtcg ctgcagcgta ggatttgcag ccacaaatgc agctggacaa 660 

cctggcttcg tgacagctgg acattgcggc cgcgtcggta cacaggttac tatcggcaat 720 

ggaagaggtg tctttgagca aagcgtattt cccgggaatg atgctgcctt cgttagaggt 780 

acgtccaact ttacgcttac taacttagta tctagataca acactggcgg atatgcaact 840 

gtagcaggtc acaatcaagc acctattggc tctagcgtct gccgctcagg gtcgactaca 900 

ggatggcatt gtggaaccat tcaagctaga ggtcagagcg tgagctatcc tgaaggtacc 960 

gtaacgaaca tgactcgtac gactgtatgt gcagaaccag gtgactctgg aggttcatat 1020 

atcagcggta cgcaagcgca aggcgttacc tcaggtggat ccggtaactg taggacaggt 1080 

ggcacaacgt tctaccagga agtgacaccg atggtgaact cttggggagt tagactccgt 1140 

3Cataa 1146 
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<211> 1068 

<212> DNA 

<213> Nocardiopsis Alba DSM 15647 

<400> 32 



gcgaccggcc 


ccctccccca 


gtcccccacc 


ccggatgaag 


ccgaggccac caccatggtc 


60 


gaggccctcc 


agcgcgacct 


cggcctgtcc 


ccctctcagg 


ccgacgagct cctcgaggcg 


120 


caggccgagt 


ccttcgagat 


cgacgaggcc 


gccaccgcgg 


ccgcagccga ctcctacggc 


180 


ggctccatct 


tcgacaccga cagcctcacc 


ctgaccgtcc 


tggtcaccga cgcctccgcc 


240 


gtcgaggcgg 


tcgaggccgc 


cggcgccgag 


gccaaggtgg 


tctcgcacgg catggagggc 


300 


ctggaggaga 


tcgtcgccga 


cctgaacgcg 


gccgacgctc 


agcccggcgt cgtgggctgg 


360 


taccccgaca 


tccactccga 


cacggtcgtc 


ctcgaggtcc 


tcgagggctc cggtgccgac 


420 


gtggactccc 


tgctcgccga 


cgccggtgtg 


gacaccgccg 


acgtcaaggt ggagagcacc 


480 


accgagcagc 


ccgagctgta 


cgccgacatc 


atcggcggtc 


tcgcctacac catgggtggg 


540 


cgctgctcgg 


tcggcttcgc 


ggccaccaac 


gcctccggcc 


agcccgggtt cgtcaccgcc 


600 


ggccactgcg 


gcaccgtcgg 


caccccggtc 


agcatcggca 


acggccaggg cgtcttcgag 


660 


cgttccgtct 


tccccggcaa 


cgactccgcc 


ttcgtccgcg 


gcacctcgaa cttcaccctg 


720 


accaacctgg 


tcagccgcta caacaccggt 


ggttacgcga 


ccgtctccgg ctcctcgcag 


780 


gcggcgatcg 


gctcgcagat 


ctgccgttcc 


ggctccacca 


ccggctggca ctgcggcacc 


840 


gtccaggccc 


gcggccagac 


ggtgagctac 


ccccagggca 


ccgtgcagaa cctgacccgc 


900 


accaacgtct 


gcgccgagcc 


cggtgactcc 


ggcggctcct 


tcatctccgg cagccaggcc 


960 


cagggcgtca 


cctccggtgg 


ctccggcaac 


tgctccttcg 


gtggcaccac ctactaccag 


1020 


gaggtcaacc 


cgatgctgag 


cagctggggt 


ctgaccctgc 


gcacctga 


1068 



<210> 33 

<211> 355 

<212> PRT 

<213> Nocardiopsis Alba dsm 15647 
<220> 

<221> PROPEP 

<222> (1) . . (167) 

<220> 

<221> mat_peptide 

<222> (168)..C355) 

<400> 33 



Ala Thr Gly Pro Leu Pro Gin ser Pro Thr Pro Asp Glu Ala Glu 
-lo5 -160 _155 



Ala Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu ser 

-145 -140 

Pro ser Gin Ala Asp Glu Leu Leu Glu Ala Gin Ala Glu ser Phe 
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-135 -130 _125 

Glu He Asp Glu Ala Ala Thr Ala Ala Ala Ala Asp ser Tyr Gly 
-120 -115 _ 110 

Gly Ser lie Phe Asp Thr Asp ser Leu Thr Leu Thr val Leu Val Thr 
-xui -100 -95 

Asp Ala ser Ala val Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys 
" yu -85 -80 

Val val ser His Gly Met Glu Gly Leu Glu Glu He val Ala Asp Leu 
/:> " 70 -65 -60 

Asn Ala Ala Asp Ala Gin Pro Gly Val Val Gly Trp Tyr pro Asp lie 
-55 -50 -45 

His ser Asp Thr val val Leu Glu Val Leu Glu Gly ser Gly Ala asd 
" 40 -35 -30 

val Asp ser Leu Leu Ala Asp Ala Gly Val Asp Thr Ala Asp val Lys 
~" -20 -15 

val Glu ser Thr Thr Glu Gin pro Glu Leu Tyr Ala Asp He He Gly 
— -5 -11 5 

Gly Leu Ala Tyr Thr Met Gly Gly Arg cys Ser Val Gly phe Ala Ala 
10 15 20 

Thr Asn Ala ser Gly Gin Pro Gly Phe val Thr Ala Gly His cys Gly 
« 30 35 

Thr val Gly Thr Pro Val ser He Gly Asn Gly Gin Gly val Phe Glu 
40 45 50 

Arg ser val Phe Pro Gly Asn Asp ser Ala Phe val Arg Gly Thr ser 
33 60 65 

Asn Phe Thr Leu Thr Asn Leu Val Ser Arg Tyr Asn Thr Gly Gly Tyr 
70 75 80 ' 85 

Ala Thr val ser Gly ser ser Gin Ala Ala lie Gly ser Gin He Cys 
90 95 100 

Arg ser Gly ser Thr Thr Gly Trp His Cys Gly Thr val Gin Ala Arg 
105 110 H5 a 

Gly Gin Thr Val ser Tyr Pro Gin Gly Thr val Gin Asn Leu Thr Arg 
120 125 130 M 

Thr Asn val Cys Ala Glu Pro Gly Asp Ser Gly Gly ser Phe lie ser 
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Gly Ser Gin Ala Gin Gly Val Thr ser Gly Gly ser Gly Asn cys Sen 

155 160 165 

Phe Gly Gly Thr Thr Tyr Tyr Gin Glu val Asn Pro Met Leu ser ser 
170 175 180 

Trp Gly Leu Thr Leu Arg Thr 
185 

<210> 34 
<211> 43 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1421 
<400> 34 

gttcatcgat cgcatcggct gcgaccggcc ccctccccca gtc 43 

<210> 35 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1604 

<400> 35 

gcggatccta tcaggtgcgc agggtcagac c 31 

<210> 36 
<211> 1062 
<212> DNA 

<213> Nocardiopsis prasina DSM 15648 
<400> 36 

gccaccggac cgctccccca gtcacccacc ccggaggccg acgccgtctc catgcaggag 60 

gcgctccagc gcgacctcgg cctgaccccg cttgaggccg atgaactgct ggccgcccag 120 

gacaccgcct tcgaggtcga cgaggccgcg gccgcggccg ccggggacgc ctacggcggc 180 

tccgtcttcg acaccgagac cctggaactg accgtcctgg tcaccgacgc cgcctcggtc 240 

gaggctgtgg aggccaccgg cgcgggtacc gaactcgtct cctacggcat cgagggcctc 300 

gacgagatca tccaggatct caacgccgcc gacgccgtcc ccggcgtggt cggctggtac 360 

ccggacgtgg cgggtgacac cgtcgtcctg gaggtcctgg agggttccgg agccgacgtg 420 

agcggcctgc tcgccgacgc cggcgtggac gcctcggccg tcgaggtgac cagcagtgcg 480 

cagcccgagc tctacgccga catcatcggc ggtctggcct acaccatggg cggccgctgt 540 

tcggtcggat tcgcggccac caacgccgcc ggtcagcccg gattcgtcac cgccggtcac 600 

tgtggccgcg tgggcaccca ggtgagcatc ggcaacggcc agggcgtctt cgagcagtcc 660 

atcttcccgg gcaacgacgc cgccttcgtc cgcggcacgt ccaacttcac gctgaccaac 720 
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ctggtcagcc gctacaacac cggcggttac gccaccgtcg ccggccacaa ccaggcgccc 780 

atcggctcct ccgtctgccg ctccggctcc accaccggct ggcactgcgg caccatccag 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatgac ccggaccacc 900 

gtgtgcgccg agcccggcga ctccggcggc tcctacatct ccggcaacca ggcccagggc 960 

gtcacctccg gcggctccgg caactgccgc accggcggga ccaccttcta ccaggaggtc 1020 

acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 1062 

<210> 37 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina dsm 15648 
<220> 

<221> PROPEP 

<222> (1) . . (165) 

<220> 

<221> mat_pepti de 
<222> (166)..C353) 

<400> 37 

Ala Thr Gly pro Leu Pro Gin ser Pro Thr Pro Glu Ala Asp Ala 
-165 -160 -155 

Val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
-150 -145 -140 

Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
-135 -130 -125 

V ?™ Asp Glu Ala Ala Ala Ala Ala Ala G *!y Asp Ala Tyr Gly Gly 
-120 -115 _ 110 * * 

ser val Phe Asp Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp 

~ 105 -100 -95 _g5 

Ala Ala Ser Val Glu Ala val Glu Ala Thr Gly Ala Gly Thr Glu Leu 
-85 -80 -75 

Val ser Tyr Gly He Glu Gly Leu Asp Glu lie He Gin Asp Leu Asn 
-70 -65 -60 

Ala Ala Asp Ala val Pro Gly val val Gly Trp Tyr Pro Asp Val Ala 
-55 -50 _45 

Gly Asp Thr val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val 
-40 -35 -30 

Ser Gly Leu Leu Ala Asp Ala Gly val Asp Ala Ser Ala val Glu val 
"25 -20 -15 _ 10 
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Thr ser ser Ala Gin Pro Glu Leu Tyr Ala Asp lie He Gly Gly Leu 

Ala Tyr Thr Met Gly Gly Arg Cys Ser val Gly Phe Ala Ala Thr Asn 
10 15 20 

Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg val 

Gly Thr Gin Val ser lie Gly Asn Gly Gin Gly Val Phe Glu Gin Ser 
4U 45 50 55 

lie Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 
60 65 70 

Thr Leu Thr Asn Leu val ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
75 80 85 

Val Ala Gly His Asn Gin Ala Pro lie Gly ser ser Val Cys Arq ser 
90 95 100 

Gly ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin . 

ser val Ser Tyr pro Glu Gly Thr val Thr Asn Met Thr Arg Thr Thr 
120 12 5 130 135 

Val cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr lie ser Gly Asn 
140 145 15& 

Gin Ala Gin Gly^ val Thr ser Gly Gly ser Gly Asn cys Arg Thr Gly 

Gly Thr Thr Phe Tyr Gin Glu Val Thr Pro Met val Asn ser Trp Gly 
170 175 180 

val Arg Leu Arg Thr 
185 

<210> 38 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1346 

<400> 38 

gttcatcgat cgcatcggct gccaccggac cgctccccca gtc 43 



<210> 39 
<211> 38 
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PCT/DK2004/000436 

<212> DNA 10495.204-WO.ST25.txt 
<213> Artificial sequence 

<220> 

<223> Primer 1602 
<400> 39 

gcggatccta ttaggtccgg agacggacgc cccaggag 



<210> 40 

<211> 1062 

<212> DNA 

<213> Nocardiopsis prasina dsm 15649 

<400> 40 



38 



<210> 41 

<211> 353 

<212> PRT 

<213> Nocardiopsis prasina dsm 15649 
<220> 

<221> PROPEP 

<222> (1) . . (165) 

<220> 

<221> mat_peptide 

<222> C166)..C353) 

<400> 41 



60 



gccaccggac cactccccca gtcacccacc ccggaggccg acgccgtctc catgcaggag 

gcgctccagc gcgacctcgg cctgaccccg cttgaggccg atgaactgct ggccgcccag 120 

gacaccgcct tcgaggtcga cgaggccgcg gccgaggccg ccggtgacgc ctacggcggc 180 

tccgtcttcg acaccgagac cctggaactg accgtcctgg tcaccgactc cgccgcggtc 240 

gaggcggtgg aggccaccgg cgccgggacc gaactggtct cctacggcat cacgggcctc 300 

gacgagatcg tcgaggagct caacgccgcc gacgccgttc ccggcgtggt cggctggtac 360 

ccggacgtcg cgggtgacac cgtcgtgctg gaggtcctgg agggttccgg cgccgacgtg 420 

ggcggcctgc tcgccgacgc cggcgtggac gcctcggcgg tcgaggtgac caccaccgag 480 

cagcccgagc tgtacgccga catcatcggc ggtctggcct acaccatggg cggccgctgt 540 

tcggtcggct tcgcggccac caacgccgcc ggtcagcccg ggttcgtcac cgccggtcac 600 

tgtggccgcg tgggcaccca ggtgaccatc ggcaacggcc ggggcgtctt cgagcagtcc 660 

atcttcccgg gcaacgacgc cgccttcgtc cgcggaacgt ccaacttcac gctgaccaac 720 

ctggtcagcc gctacaacac cggcggctac gccaccgtcg ccggtcacaa ccaggcgccc 780 

atcggctcct ccgtctgccg ctccggctcc accaccggtt ggcactgcgg caccatccag 840 

gcccgcggcc agtcggtgag ctaccccgag ggcaccgtca ccaacatgac gcggaccacc 900 

gtgtgcgccg agcccggcga ctccggcggc tcctacatct ccggcaacca ggcccagggc 960 

gtcacctccg gcggctccgg caactgccgc accggcggga ccaccttcta ccaggaggtc 1020 
acccccatgg tgaactcctg gggcgtccgt ctccggacct aa 



1062 
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Ala Thr Gly Pro Leu Pro Gin ser Pro Thr Pro Glu Ala Asp Ala 
j-oj -XoU -155 

Val ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro 
A3U -145 -140 

Leu Glu Ala Asp Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 
J - DD -liu -125 

-i2o Asp G1u Ala Ala -lis Glu Ala Ala Gly A n 0 Ala J y r G ">y Giy 

ser val Phe Asp Thr Glu Thr Leu Glu Leu Thr val Leu Val Thr Asp 
-L"3 -100 -95 _ 9 g 

ser Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 
~ 85 -80 -75 

val ser Tyr Gly He Thr Gly Leu Asp Glu He Val Glu Glu Leu Asn 
~ /u -65 -60 

Ala Ala Asp Ala Val Pro Gly Val Val Gly Trp Tyr Pro Asp Val Ala 
"■>•> -50 -45 

Gly Asg Thr val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val 

Gly Gly Leu Leu Ala Asp Ala Gly val Asp Ala ser Ala val Glu val 

-20 -15 -10 

Thr Thr Thr Glu Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 

Ala Tyr Thr Met Gly Gly Arg cys ser Val Gly Phe Ala Ala Thr Asn 

Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arg val 

Gly Thr Gin val Thr lie Gly Asn Gly Arg Gly val Phe Glu Gin Ser 
w 45 50 55 

He Phe Pro Gly Asn Asp Ala Ala Phe val Arg Gly Thr Ser Asn Phe 
t>u 65 70 

Thr Leu Thr Asn Leu val ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
'•> 80 85 

Val Ala Gly His Asn Gin Ala Pro He Gly ser ser Val Cys Arg ser 
90 95 100 
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Gly Ser Thr Thr Gly Trp His cys Gly Thr He Gin Ala Arg Gly Gin 

Sen val ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 

A " 130 135 

val cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr He Ser Gly Asn 
±w 145 155 

Gin Ala Gin Gly val Thr ser Gly Gly ser Gly Asn cys Arg Thr Gly 

Gly Thr Thr Phe Tyr Gin Glu val Thr Pro Met Val Asn ser Trp Gly 
A/u 175 180 y 

Val Arg Leu Arg Thr 
185 

<210> 42 

<211> 43 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Primer 1603 

<400> 42 

gttcatcgat cgcatcggct gccaccggac cactccccca gtc 

<210> 43 

<211> 353 

<212> prt 

<213> Nocardiopsis sp. nrrl 18262 
<220> 

<221> PROPEP 

<222> (1) . . (165) 

<220> 

<221> mat_peptide 
<222> (166).. (1059) 

<400> 43 

-165 Gly Ala L6U :!% 0 Gln Ser p ™ Thr P_ro 5 Glu Ala Asp Ala 

yal o ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser 

-j-HD -140 

Al| s Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu 

13U -125 

-120 ASP GlU Ala Ala A ]U G1U Ala Ala G ^ Asp Ala Tyr Gly Gly 
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s ?£r- Va1 Phe As P Thr Glu Ser Leu Glu Leu Thr val Leu val Thr Asp 
-105 -100 -95 -90 

Ala Ala Ala Val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Glu Leu 
"85 -80 -75 

Val ser Tyr Gly He Asp Gly Leu Asp Glu lie val Gin Glu Leu Asn 
"70 -65 -60 

Ala Ala Asp Ala val Pro Gly Val Val Gly Trp Tyr Pro Asp val Ala 
-55 -50 -45 

Gly Asp Thr val val Leu Glu Val Leu Glu Gly ser Gly Ala Asp val 
-40 -35 -30 

ser Gly Leu Leu Ala Asp Ala Gly Val Asp Ala ser Ala Val Glu val 
- 25 -20 -15 -10 

Thr Thr Ser Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu 
-5 -11 5 

Ala Tyr Thr Met Gly Gly Arg Cys Ser Val Gly phe Ala Ala Thr Asn 
10 15 20 

Ala Ala Gly Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Arq val 
25 30 35 

Gly Thr Gin val Thr lie Gly Asn Gly Arg Gly val Phe Glu Gin ser 
40 45 50 55 

Val Phe Pro Gly Asn Asp Ala Ala Phe Val Arg Gly Thr Ser Asn Phe 
60 65 70 

Thr Leu Thr Asn Leu val ser Arg Tyr Asn Thr Gly Gly Tyr Ala Thr 
75 80 85 

val Ala Gly His Asn Gin Ala Pro lie Gly ser Ser val cys Arq ser 
90 95 100 

Gly ser Thr Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Gly Gin 
105 110 115 

ser val Ser Tyr Pro Glu Gly Thr Val Thr Asn Met Thr Arg Thr Thr 
120 125 130 135 

Val cys Ala Glu Pro Gly Asp ser Gly Gly ser Tyr lie Ser Gly Thr 
140 145 150 

Gin Ala Gin Gly Val Thr Ser Gly Gly ser Gly Asn Cys Arg Thr Gly 
155 160 165 
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Gly Thr Thr Phe Tyr Gin Glu val Thr pro Met val Asn Ser Trp Gly 
170 175 180 

val Arg Leu Arg Thr 
185 

<210> 44 

<211> 1164 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic protease encoding gene 
<220> 

<221> CDS 

<222> (1)..(1164) 

<223> Full length protease 

<220> 

<221> sig_peptide 
<222> (15.. (81) 

<220> 

<221> mi sc_f eatu re 

<222> (82) . . (1164) 

<223> Propeptide 

<220> 

<221> mat_peptide 
<222> (577).. (1164) 

<400> 44 

atg aaa aaa ccg ctg gga aaa att gtc gca age aca aca ctt ctt ai 
Met Lys Lys Pro Leu G Ty Lys He val Ala sir Th? ?la EeJ ES 
-190 -185 -180 

att tea gtg gca ttt age tea tct att gca tea aca act aca aaa on 
He ser VaT Ala Phe ser ser ser He Ala seF Ala Ala Thr G?y 90 
~ 175 -170 -165 

gca tta ccg cag tct ccg aca ccg gaa gca gat aca ate tea atn 
Ala Leu Pro Gin ser Pro Thr Pro hu hi Xip All val ler Set 35 
_lb0 -155 -150 

rln r?M ?i a P tg S aa aga gat ctt aat ctt aca tca gca gaa gca 180 
Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala Glu Ala 
-145 -140 _135 

rin ?? a ctt ? ct gca caa gat aca aca "t 9 aa gtg gat gaa 225 

Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu VaT Asp Glu 
--"U -125 -120 

Ail Ail In 3 ^? a ??? f? a ?? a gat gca tat g? c gg c *ca g tt ttt 270 

Aia Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly ser val Phe 
"115 -110 _ 105 

gat aca gaa tca ctt gaa ctt aca gtt ctt gtt aca aat aca aca aca *ir 
Asp Thr Glu ser Leu Glu Leu Thr val Leu val fB £p SlS XK JS 318 
-J-UU -95 -go 

Si r?M ?i a 85? 2 aa ? ca ^£ a gga gca g 9 a aca g ta ctt gtt tea tat 366 
val Glu Ala val Glu Ala Thr Gly Ala Gly Thr Val Leu val ser Tyr 
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aj sr ss «? si ss as sr «a % h- a as ss ss ss 414 

»•> -60 -55 

SS «Ef S3 B5 8H 83 815 tS? # SS JBJ §3 SS §?? «* SS < 62 

_:>0 -45 _ 4 5 

«H 8f SI S! fif £2 §S IS g fg ffl» a* g g?; a 

33 -30 -25 

a ss § ss is «f ss ?g g «- ?« g? = ^ - a fi? g 558 

SS a! SS K SI Si SS SS IB SB SS SS SS SS SS Si 606 

5 10 

53 m 95 SS §J SS Si SS Si SS SB ?g « jr g ^ 6*4 

xd 20 25 

SS SS H5 jg IS S? JR fg SS & g?j ?s? §3 1 ?5 3- - 7o 2 



25 

t tgc ggc aca gtt ggc 
s cys Gly Thr val Gly 
35 40 

«S g lie H5 SS SS K § 83 SS SS 25 IS IB £ SS 750 



55 



m ss ss g ss si © ss is s? g ss as ss si ss ™ 
ss a g? g ss gj as g §s si jk ?ss g. a* 

85 go 

SB ss as SS SS IB H$ g ss si? ss SS g B5 s ss 894 

yD 100 10 5 

SS m IS SB SS gg SS IS » SS SS SS SS SS S3 ss 942 

115 120 y 

Si SS SB IS SS S3 SI IS ss ss ss ss SS S3 si ss 990 

ss ss m ss g gg gj g si a* -g ^ ? « ^ c„ xo 38 

ffl S3 SS SS gS 1|5 g m ss ss Iff SS SS SS SS SS 1086 

s? s? as is ss ss ss ss si ts ss ss as si ss si 1134 

180 185 
aga aca caa teg cat gtt caa tec art rra 

Arg Thr Gin ser His val Gin sir Ala SS 1164 
190 195 
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<210> 45 
<211> 388 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> synthetic construct 
<400> 45 

Met Lys Lys Pro Leu Gly Lys He val Ala ser Thr Ala Leu Leu 
- 190 -185 -180 

He ser val Ala Phe Ser ser ser He Ala ser Ala Ala Thr Gly 
-175 -170 -165 

Ala Leu Pro Gin ser pro Thr Pro Glu Ala Asp Ala val ser Met 
-160 -155 _ 150 

Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala Glu Ala 
-145 -140 _ 135 

Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 
-130 -125 -120 

Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly ser val Phe 
-lis -110 * -105 

Asp Thr Glu ser Leu Glu Leu Thr Val Leu Val Thr Asp Ala Ala Ala 
-luo -95 -go 

val Glu Ala Val Glu Ala Thr Gly Ala Gly Thr Val Leu val ser Tyr 
-oo -80 -75 

Gly He Asp Gly Leu Asp Glu He val Gin Glu Leu Asn Ala Ala Asp 
/u -65 -60 -55 

Ala val Pro Gly val val Gly Trp Tyr Pro Asp Val Ala Gly Asp Thr 
-50 -45 -40 

Val val Leu Glu val Leu Glu Gly Ser Gly Ala Asp val ser Gly Leu 
-35 -30 _25 

Leu Ala Asp Ala Gly val Asp Ala Ser Ala val Glu val Thr Thr ser 

-15 -10 

Asp Gin Pro Glu Leu Tyr Ala Asp lie lie Gly Gly Leu Ala Tyr Tyr 

-11 5 10 

Met Gly Gly Arg cys Ser val Gly Phe Ala Ala Thr Asn Ala ser Gly 
15 20 25 
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Gin Pro Gly Phe Val Thr Ala Gly His Cys Gly Thr Val Gly Thr Pro 
30 35 40 

Val Ser lie Gly Asn Gly Lys Gly val Phe Glu Arg ser lie Phe Pro 
45 50 55 

Gly Asn Asp ser Ala Phe val Arg Gly Thr Ser Asn Phe Thr Leu Thr 
60 65 70 

Asn Leu val ser Arg Tyr Asn ser Gly Gly Tyr Ala Thr val Ala Gly 
75 80 85 90 

His Asn Gin Ala Pro lie Gly ser Ala val cys Arg ser Gly ser Thr 
95 100 105 

Thr Gly Trp His Cys Gly Thr He Gin Ala Arg Asn Gin Thr Val Arg 
110 115 120 

Tyr Pro Gin Gly Thr Val Tyr Ser Leu Thr Arg Thr Thr val cys Ala 
125 130 135 

Glu Pro Gly Asp ser Gly Gly ser Tyr lie Ser Gly Thr Gin Ala Gin 
140 145 150 

Gly Val Thr ser Gly Gly Ser Gly Asn cys Ser Ala Gly Gly Thr Thr 
I 55 160 165 170 

Tyr Tyr Gin Glu val Asn Pro Met Leu Ser Ser Trp Gly Leu Thr Leu 
175 180 185 

Arg Thr Gin ser His val Gin Ser Ala Pro 
190 195 

<210> 46 
<211> 165 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide 0-2.19 
<220> 

<221> PROPEP 
<222> CD • - (165) 

<400> 46 

Ala Thr Gly Ala Leu Pro Gin Ser pro Thr Pro Glu Ala Asp Ala val 
15 10 15 

ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr Ser Ala Glu 
20 25 30 

Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 

Page 30 



WO 2004/111224 PCT/DK2004/000436 

A 10495.204-WO.ST25.txt 
« 40 45 

Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly ser Val Phe Asp 
JU jj 60 

Thr Glu Ser Leu Thr Leu Thr val Leu val Thr Asp Ala Ser Ala Val 
D:> 70 75 80 

Glu Ala Val Glu Ala Ala Gly Ala Glu Ala Lys val val ser His Glv 
85 go 95 

Met Glu Gly Leu Glu Glu He val Ala Asp Leu Asn Ala Ala Asp Ala 
i-UU 105 110 

Gin Pro Gly val val Gly Trp Tyr Pro Asp lie His Ser Asp Thr Val 

120 125 

val Leu Glu val Leu Glu Gly ser Gly Ala Asp val Asp ser Leu Leu 
Xd0 135 140 

Ala Asp Ala Gly val Asp Ala Ser Ala Val Glu Val Thr Thr ser Asp 



160 



Gin Pro Glu Leu Tyr 
165 

<210> 47 
<211> 166 
<212> prt 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-2.73 
<220> 

<221> PROPEP 
<222> (1) . . (166) 

<400> 47 

Ala Thr Gly Ala Leu Pro Gin ser Pro Thr pro Glu Ala Asp Ala val 

10 15 

ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Ser ser Ala Glu 
^° 25 30 

Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 
35 40 45 

Ala Ala Ala Gly Ala Ala Gly Asp Ala Tyr Gly Gly ser Val Phe Asp 
3U 55 60 

Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp Ala ser Ala Val 

70 75 80 
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Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys Val Val Ser His Gly 
85 90 95 

Met Glu Gly Leu Glu Glu lie Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 

Gin Pro Gly val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 
115 120 125 

val val Glu val Leu Glu Gly ser Gly Ala Asp val Asp ser Leu Leu 
130 135 140 

Ala Asp Ala Gly val Asp Thr Ala Asp val Lys Val Glu ser Thr Thr 

150 155 160 

Glu Gin Pro Glu Leu Tyr 
165 

<210> 48 
<211> 166 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-1.43 
<220> 

<221> PROPEP 
<222> (D.-C166) 

<400> 48 

Ala Thr Gly Ala Leu Pro Gin Ser Pro Thr Pro Glu Ala Asp Ala Val 
x 5 10 15 

Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser Ser ser Gin 
zo 25 30 

Ala Glu Glu Leu Leu Asp Ala Gin Ala Glu ser Phe Glu He Asp Glu 
35 40 45 

Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly ser lie Phe Asp 
DU 55 60 

Thr Asp ser Leu Thr Leu Thr val Leu val Thr Asp Ala ser Ala val 
DD 70 75 80 

Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys val Val Ser His Gly 
85 90 95 J 

Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 HO 
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Gin Pro Gly val val Gly Trp Tyr Pro Asp He His Ser Asp Thr val 

120 125 

val Leu Glu val Leu Glu Gly ser Gly Ala Asp Val Asp ser Leu Leu 

135 140 

Ala Asp Ala Gly val Ajg Thr Ala Asp val Lys val Glu ser Thr Thr 



155 160 



Glu Gin Pro Glu Leu Tyr 
165 



<210> 49 

<211> 166 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-2.6 

<400> 49 



Ala Thr Gly Ala Leu Pro Gin ser Pro Thr Pro Glu Ala Asp Ala Val 
5 10 15 

ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala Glu 
£V 25 30 

Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 
s-> 40 45 

Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly ser He Phe Asp 

35 60 

Thr Glu Thr Leu Glu Leu Thr Val Leu Val Thr Asp ser ser ser Val 

/u 75 80 

Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys val val ser His Gly 
85 90 95 ' 

Met Glu Gly Leu Glu Glu He val Ala Asp Leu Asn Ala Ala Asp Ala 
J.uu 105 2.10 

Gin Pro Gly val val Gly Trp Tyr Pro Asp He His ser Asp Thr Val 
J " i:> 120 125 

val Leu Glu val Leu Glu Gly ser Gly Ala Asp val Asp ser Leu Leu 
" v 13 3 140 

Ala Gly Ala Gly val Asp Thr Ala Asp Val Lys val Glu ser Thr Thr 

15° 155 160 

Glu Gin pro Glu Leu Tyr 
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165 

<210> 50 
<211> 165 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-2.5 
<220> 

<221> PROPEP 
<222> (1) . . (165) 

<400> 50 

Ala Thr Gly Ala Leu Pro Gin ser pro Thr pro Glu Ala Asp Ala val 
5 10 15 

ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro Leu Glu 
^° 25 30 

Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val Asp Glu 
" 40 45 

Ala Ala Ala Glu Ala Ala Gly Asp Ala Tyr Gly Gly ser Val Phe Asp 
-> u 55 60 

Thr Glu Thr Leu Glu Leu Thr val Leu val Thr Asp Ala Ser Ala val 

70 75 80 

Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys Val val Ser His Glv 
85 90 95 

Met Glu Gly Leu Glu Glu lie val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 110 

Gin Pro Gly Val Val Gly Trp Tyr Pro Asp lie His Ser Asp Thr val 
115 120 125 

val Leu Glu val Leu Glu Gly Ser Gly Ala Asp val Asp ser Leu Leu 
130 135 140 

Ala Asp Ala Gly val Asp Ala Ser Ala Val Glu val Thr Pro Ala Ala 
±n 150 155 160 

Arg Pro Glu Leu Tyr 
165 

<210> 51 
<211> 166 
<212> PRT 

<213> Artificial sequence 
<220> 
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<220> 

<221> PROPEP 
<222> CI) . . (166) 

<400> 51 

Ala Thr Gly Ala Leu Pro Gin ser Pro Thr Pro Asp Gly Ala Glu Ala 

10 15 

Thr Thr Met Val Glu Ala Leu Gin Arg Asp Leu Gly Leu Thr Pro Ala 
m 25 30 

Glu Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu Val asp 
35 40 45 K 

Glu Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly ser lie Phe 
■ >u 55 60 

Asp Thr Asp ser Leu Thr Leu Thr val Leu val Thr Asp Ala Ala Ala 

/0 75 80 

val Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys val val ser His 
85 90 " 95 

Gly Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp 
■lvv 105 HO 

Ala val Pro Gly val Val Gly Trp Tyr Pro Asp val Ala Gly Asp Thr 

120 125 

val val Leu Glu val Leu Glu Gly ser Gly Ala Asp val Tyr ser Leu 
Aau 135 140 

Leu Ala Asp Ala Gly val Asp Ala ser Ala val Glu val Thr Pro Ala 

150 155 160 

Ala Gin Pro Glu Leu Tyr 
165 

<210> 52 
<211> 166 
<212> prt 

<213> Artificial sequence 
<220> 

<223> Shuffled propeptide G-1.4 
<220> 

<221> propep 
<222> CD . . C166) 

<400> 52 
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Ala Thr Gly Ala Leu Pro Gin ser° Pro' Thr *p ro^Gl u' All Asp Ala val 
15 10 15 

ser Met Gin Glu Ala Leu Gin Arg Asp Leu Gly Leu Ser Ser Ser Gin 
^ u 25 30 

Ala Glu Glu Leu Leu Asp Ala Gin Ala Glu ser phe Glu lie Asp Glu 
3i 40 45 

Ala Ala Ala Ala Ala Ala Ala Asp Ser Tyr Gly Gly ser He Phe Asp 
3U 55 60 

Thr Asp ser Leu Thr Leu Thr Val Leu Val Thr Asp Ala ser Ala val 
3 70 75 80 

Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys val val ser His Glv 
85 90 95 3 

Met Glu Gly Leu Glu Glu He val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 HO 

Gin pro Gly val Val Gly Trp Tyr Pro Asp He His ser Asp Thr Val 
115 120 125 

val Leu Glu val Leu Glu Gly Ser Gly Ala Asp val Asp ser Leu Leu 
" u 135 140 

Ala Asp Ala Gly val Asp Thr Ala Asp Val Lys Val Glu ser Thr Thr 
3 150 155 160 



Glu Gin Pro Glu Leu Tyr 
165 



<210> 53 

<211> 166 

<212> prt 

<213> Artificial sequence 
<220> 

<223> shuffled propeptide G-1.2 
<220> 

<221> PROPEP 

<222> (1) . . (166) 

<400> 53 



Ala Thr Gly Ala Leu Pro Gin ser Pro Thr Pro Glu Ala Asp Ala val 
1 5 10 15 

Ser Met Gin Glu Ala Leu Gin Arg Asp Leu Asp Leu Thr ser Ala Glu 
10 25 30 

Ala Glu Glu Leu Leu Ala Ala Gin Asp Thr Ala Phe Glu val Asp Glu 
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35 40 45 

Ala Ala Ala Ala Ala Ala Gly Asp Ala Tyr Gly Gly Ser He Phe Asp 

60 

Thr Glu Thr Leu Glu Leu Thr Val Leu val Thr Asp Ser Ser Ser val 

70 75 80 

Glu Ala val Glu Ala Ala Gly Ala Glu Ala Lys val val ser His Gly 
85 90 95 

Met Glu Gly Leu Glu Glu He Val Ala Asp Leu Asn Ala Ala Asp Ala 
100 105 HO 

Gin Pro Gly Val Val Gly Trp Tyr Pro Asp He His Ser Asp Thr Val 

120 125 

val Leu Glu val Leu Glu Gly ser Gly Ala Asp val Asp ser Leu Leu 

Ala Gly Ala Gly val Asp Thr Ala Asp Val Lys val Glu Ser Thr Thr 
3 150 155 160 

Glu Gin Pro Glu Leu Tyr 
165 
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